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CONTENT ADDRESSABLE MEMORY HAVING DYNAMIC MATCH 

RESOLUTION 



Cross-reference to related application 

[0001] This application is a continuation-in-part, and claims benefit under 35 USC 
§120, of co-pending and commonly owned, U.S. Patent Application Serial No. 
10/1 12,630 filed March 29, 2002 and entitled "Content Addressable Memory Having 
Dynamic Match Resolution," which is incorporated by reference herein. 

Field of Invention 

[0002] The field of invention relates generally to the semiconductor arts; and, more 
specifically, to a method and apparatus for a content addressable memory (CAM) having 
dynamic match resolution. 

Background 

[0003] Content addressable memories (CAMs) can be used within a wealth of 
different electronic systems (e.g., computing systems such as laptop computers, personal 
computers (PCs), servers, etc.; networking systems such as routers, switches, base 
stations, etc.; etc.,). Often, a CAM is used to perform a "look-up" fimction. For 
example, in networking applications, a portion of the header information of a packet is 
often used as a search key or comparand to look-up or identify an entry in the CAM. The 
address at which the matching entry is located in the CAM can then be used to address or 
index a location in associated memory where attributes as to how the packet is to be 
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handled are found (e.g., an attribute that identifies the priority level of the packet, an 
attribute that identifies the specific fiber-optic cable over which the packet is to be 
launched, etc.). 

[00041 Figure 1 shows an example of at least a portion 1 00 of a CAM architecture. 
According to the depiction of Figure 1. CAM architecture 100 includes a plurality ("K") 
of rows of CAM cells 140i through 140k. Each CAM cell includes a memory storage 
element and a compare circuit and can be binary (i.e., effectively stores and compares on 
two logic states - logic 0 and logic 1) or ternary (i.e., effectively stores and compares on 
three logic states - logic 0, logic 1, and a don't care state). 
[00051 A comparand is typically registered in comparand register 1 08 and then 
compared with the data values stored within the plurality of CAM cells 140. The 
comparison resuhs are reflected as match signals on corresponding match signal lines 
1 lOi-l 10k. The match signals are typically then latched by latch 102. Latch 102 has K 
latch circuits 102i-102k that each latch a corresponding match signal state in response to 
the latch enable signal generated by timing generator 103. The latch enable signal is 
typically generated in response to a cycle clock and control signal CMP that indicates 
when a compare operation is taking place. 

[00061 The latched match signals are provided to priority encoder 1 05 and flag logic 
1 13 over signal lines 104i-104k. Priority encoder 105 processes the match signals in a 
predetermined manner to determine which of the matching entries (i.e., those data values 
in the CAM array that match the comparand) is the highest priority matching entry. An 
index or address of the highest priority matching entry m the CAM array is then output 
fi-om priority encoder 105 over signal lines 106. Flag logic 1 13 typically generates a 
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match flag signal that indicates if one or more of the match signals indicates a match, and 
thus qualifies the output of the priority encoder. 

[0007] Semiconductor integrated circuit (IC) designers typically specify minimum 
performance levels that the IC will meet or exceed across a wide range of operating 
conditions of a CAM device. The operating conditions include environmental conditions 
(e.g., temperature, moisture content in the operating environment, pressure, etc.), 
electrical conditions (e.g., supply voltage, input and output voltages, currents and 
loading, etc.), manufacturing variations (e.g., fabrication variations in transistor 
geometries and properties, material geometries and properties, etc., which are commonly 
referred to as "process variations"), and fimctional considerations (e.g., which operations 
have been performed by the device and which operations are currently being performed 
by the device, etc.). 

With respect to CAM architecture 100, timing generator 103 is typically designed using 
simulation tools to transition the latch enable signal at a point in time when all of the 
match signals are stable after a compare operation. As such, the timing generator is 
typically designed to transition the latch enable signal to capture accurate match results 
even under worst case operating conditions. Worst case operating conditions are typically 
high operating temperature, low supply voltage, slow fabrication geometries, and the 
slowest time required for CAM array 140 to perform a match and/or a mis-match 
operation. Although this design approach may guarantee a minimum performance under 
worst case operating conditions, it sacrifices better performance that could have been 
realized under better than worst case operating conditions. This can adversely affect the 
rated performance of the CAM device and can decrease the overall operating frequency. 
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speed or throughput of the device. For example, the longer time that is required to latch 
the match signals, the later in a particular clock cycle (e.g., of the cycle clock) that it 
takes for the priority encoder and flag logic to perform their respective operations. This 
slows down the maximum operating speed of the device. 

[00081 Additionally, timing generator 103 is typically designed as a pulse generator 
(e.g., a one-shot) and/or a delay circuit (e.g., one or more inverting or non-inverting 
buffer circuits or resistor-capacitor (RC) circuits). Since these circuits are designed 
differently than CAM array 140 (i.e., do not include CAM cells), as operating conditions 
change the latch enable signal output by timing generator 103 typically does not track in 
time or voltage proportionally with changes to the match results provided on match lines 
1 10. This variation is further simulated and accounted for during design, and typically 
results m additional delay added to the point in time at which the latch enable signal is 
designed to transition. 
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Figures 

[0009] The present invention is illustrated by way of example, and not limitation, in 

the Figures of the accompanying drawings in which: 

[0010] Figure 1 shows an example of a portion of a CAM architecture. 

[0011] Figure 2 shows an embodiment of a portion of a CAM architecture including 

a dynamic timing generator. 

[0012] Figure 3 shows one embodiment of a row of CAM cells. 

[0013] Figure 4 shows an exemplary timing diagram of the operation of the row of 

CAM cells of Figure 3. 

[0014] Figure 5 shows an embodiment of a binary CAM cell. 

[0015] Figure 6 shows another embodiment of a row of CAM cells. 

[0016] Figure 7 shows an exemplary timing diagram of the operation of the row of 

CAM cells of Figure 6. 

[0017] Figure 8 shows an exemplary timing diagram of the dynamic operation of the 
dynamic timing generator of Figure 2. 

[0018] Figure 9 illustrates the operation of the CAM architecture of Figure 2. 
[0019] Figure 10 shows an embodiment of the dynamic timing generator of Figure 
2. 

[0020] Figure 11 shows an embodiment of the match or mis-match comparison 
timing circuits of Figure 10. 

[0021] Figure 12 shows an embodiment of the operation of the comparison timing 
circuit of Figure 11. 
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[0022] Figure 13 shows another embodiment of the match or mis-match comparison 
timing circuits of Figure 10. 

[0023] Figure 14 shows another embodiment of the match or mis-match comparison 
timing circuits of Figure 10. 

[0024] Figure 15 shows another embodiment of the match or mis-match comparison 
tmung circuits of Figure 10. 

[0025] Figure 16 shows an embodiment of a CAM device that includes the CAM 
architecture of Figure 2. 

[0026] Figure 17 shows an embodiment of an appUcation that includes the CAM 
device of Figure 16. 

[0027] Figure 18 shows a more detailed configuration of one embodiment of the 
CAM architecture of Figure 2. 

[0028] Figure 19 shows another embodiment of a CAM architecture. 

[0029] Figure 20 shows one embodiment of a load element. 

[0030] Figure 21 sh9ws another embodiment of the CAM architecture of Figure 19. 

[0031] Figure 22 shows another embodiment of a CAM architecture. 

[0032] Like reference numerals refer to corresponding parts throughout the drawing 

figures. 
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Description 

[0033] In the following description, for purposes of explanation, specific 
nomenclatiire is set forth to provide a thorough understanding of the present invention. 
However, it will be apparent to one skilled in ttie art that these specific details may not be 
required to practice the present invention. In some instances, the interconnection between 
circuit elements or blocks may be shown as buses or as single signal lines. Each of the 
buses may alternatively be single signal Unes, and each of tiie single signal Unes may 
alternatively be buses. A signal is said to be "asserted" when tiie signal is driven to a low 
or high logic state (or charged to a high logic state or discharged to a low logic state) to 
indicate a particular condition. Conversely, a signal is said to be "deasserted" to indicate 
that the signal is driven (or charged or discharged) to a state otiier tiian the asserted state 
(including a high or low logic state, or die floating state that may occur when the signal 
driving circuit is transitioned to a high impedance condition, such as an open drain or 
open collector condition). A signal driving circuit is said to "output" a signal to a signal 
receiving circuit when tiie signal driving circuit asserts (or deasserts, if explicitly stated or 
indicated by context) tiie signal on a signal line coupled between tiie signal driving and 
signal receiving circuits. A signal line is said to be "activated" when a signal is asserted 
on tiie signal line, and "deactivated" when the signal is deasserted. Additionally, the 
prefix symbol "/" attached to signal names indicates tiiat tiie signal is an active low signal 
(i.e., tiie asserted state is a logic low state). A line over a signal name (e.g., 
' < signal name > ') is also used to indicate an active low signal. Active low signals may 
be changed to active high signals and vice-versa as is generally known in tiie art. 
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Additionally, the term "coupled" as used herein refers to the direct or indirect connection 
through one or more intervening circuits. 

[0034] A CAM architecture is disclosed that dynamically adjusts its match resolution 
timing in response to the applicable operating conditions under which the CAM 
architecture operates such that the CAM architecture can exhibit improved operating 
perfonnance (e.g., operating speed, throughput, etc.) during a compare operation relative 
to a CAM architecture designed according to traditional approaches. The CAM 
architecture includes a dynamic timing generator that performs one or more compare 
operations to parallel the operation of a CAM array. Thus, the CAM architecture employs 
the dynamic timing generator to "self-time" capture of match results from the CAM 
array. The dynamic timing generator outputs an enable signal to capture the match results 
from the CAM airay, and dynamically adjusts the output of the enable signal as operating 
conditions change (e.g., a change in temperature or a degradation in transistor gain). 
[00351 For example, if operating conditions improve such that faster compare 
operations are reaUzed by the CAM array, the dynamic timing generator adjusts its enable 
signal such that match results from the CAM array are captured earUer in time than under 
relatively slower operating conditions. Likewise, as operating conditions degrade such 
that slower compare operations are realized by the CAM array, the dynamic timing 
generator adjusts its enable signal such that match results from the CAM array are 
captured later in time than under relatively faster operating conditions. As such, the 
match results can be captured as soon as they are available from the CAM array. 
10036] Figure 2 shows an embodiment of at least a portion of a CAM architecture 
200 having dynamically adjustable match resolution as described just above. CAM 
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architecture 200 includes comparand register 208, CAM array 240, latch 202, priority 
encoder 205, flag logic 213, and dynamic timing generator 220. The CAM architecture 
may include other circuits (e.g., as shown in Figure 16) including, for example, a control 
circuit or instruction decoder that decodes instructions (e.g., compare instructions) and 
provides control signals to one or more of the circuit blocks, read circuitry and write 
circuitry for the CAM array, address circuitry, configurability circuits to configure the 
width and/or depth of the CAM array, error detection and/or correction circuitry, and the 
like. 

CAM array 240 includes K of rows of CAM cells 240i-240k. Each CAM cell includes at 
least one memory storage element and at least one compare circuit and can be binary (i.e., 
effectively stores and compares on two logic states - logic 0 and logic 1) or ternary (i.e., 
effectively stores and compares on three logic states - logic 0, logic 1, and a don't care 
state). A comparand is typically registered in comparand register 208 and then compared 
with the data values stored within the plurality of CAM cells 240. Comparand register 
208 is optional. The comparison results are reflected as match signals on corresponding 
match signal lines 210i-210k. The match signals are typically then latched by latch 202. 
Latch 202 has K latch circuits 202i-202k that each latch a corresponding match signal 
state in response to the latch enable signal 203 generated by dynamic timing generator 
220. The latch enable signal 203 is typically generated in response to a cycle clock, and 
may also be generated in response to a control signal that indicates when a compare 
operation is taking place. The control signal may be generated, for example, by a control 
circuit or instruction decoder that detects when a compare operation is to take place on 
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the CAM array. Latch 202 may also be any type of timed or clocked storage circuit 
including, for example, a register (e.g., a flip-flop) or other timed memory circuit. 
[0037] The latched match signals are provided to priority encoder 205 and flag logic 
213 over signal lines 204i-204k. Priority encoder 205 processes the match signals in a 
predetermined manner to determine which of the matching entries (i.e., those data values 
in the CAM array that match the comparand) is the highest priority matching entry. An 
index or address of the highest priority matching entry in the CAM array is in then output 
from priority encoder 205 over signal lines 206. For other embodiments, the priority 
encoder may be an encoder and not a priority encoder. 

[0038] Flag logic 213 generates a match flag signal that indicates if one or more of 
the match signals indicates a match, and thus qualifies the output of the priority encoder. 
The flag logic may also indicate the fill state of the CAM array by, for example, 
outputting a full flag signal, an empty signal or a partially full flag signal (e.g., by 
monitoring one or more vaUd bits each associated with an entry in the CAM array). 
[0039] Figure 3 shows a row of CAM cells 340 that is one example of a row of CAM 
cells 240 of Figure 2. Row 340 includes pre-charge circuit 320 and N CAM cells 330. 
Pre-charge circuit 320 is any type of pre-charge circuit that charges match line 310 
towards the power supply rail VDD. Each of the N CAM cells may be binary, ternary or 
other types of CAM cells, that each include at least one memory storage element for 
storing data and at least one compare circuit for comparing at least bit of the comparand 
with the stored bit in the CAM cell. Row 340 will be generally referred to hereafter as 
"NOR-based" because if any compare circuit within a particular CAM cell determines 
that its stored data bit does not match a corresponding bit in the comparand, then the 
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match line is discharged by the CAM cell towards ground. Only if all of the CAM cells 
store data that matches corresponding bits in the comparand will the match line remain in 
a pre-charged state. 

[0040] This operation is further shown in Figure 4 in which from time TO to time Tl 
there is a match state. At time Tl there is at least one CAM cell that stores a bit that mis- 
matches a corresponding bit in the comparand, and that CAM cell starts to discharge the 
match line towards ground. At time T2, the voltage on match line 310 crosses VTH, 
which is the voltage that delineates a match from a mis-match state on the match line. At 
time T3, a match condition is again detennined. At tune T4 the match condition is 
represented on match line 310. 

[00411 Figure 5 shows binary CAM cell 530 that is one embodiment of any of CAM 
cells 330 of Figure 3. CAM cell 530 includes a memory cell 540 and a compare circuit 
550. Memory cell 540 may be any type of memory cell including a SRAM cell, DRAM, 
cell or any other type of volatile or non-volatile memory cell. As shown, both the data bit 
and its logical complement stored in memory cell 540 are provided to respective gates of 
transistor Q4 and Q2. Transistor Q2 is coupled in series with transistor Ql between the 
match line and ground. Transistor Ql has its gate coupled to a bit of comparand data C. 
Similarly, transistor Q4 is coupled in series with transistor Q3 between the match line and 
ground. Transistor Q4 has its gate coupled to receive the logical complement of the 
comparand bit. As shown in Figure 5. if the stored data and the comparand are the same 
logical state (i.e., match), then no path to ground is established for the match line. If, 
however, they do not have the same logical state (i.e., do not match), then a discharge 
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path to ground exists for the match line. Other embodiments of compare circuit may be 
used. 

[00421 Figure 6 shows a row of CAM cells 640 that is another embodiment of 

any of the row of CAM cells of Figure 2. Row 640 includes pre-charge circuit 320, N 
CAM cells 630r630N, and N pass transistors Ql-QN. Each of the N CAM cells 630i- 
630n maybe binary, ternary or other types of CAM cells, that each include at least one 
memory storage element for storing data and at least one compare circuit for comparing 
at least one bit of the comparand with the stored bit in the CAM cell. Row 640 will be 
referred to generally hereafter as "NAND-based" because the match line is discharged 
toward ground only if all of the compare circuits within the CAM cells determine that its 
stored data matches the comparand. When a CAM cell stores data that matches a 
corresponding bit in the comparand, it sends a signal to the gate of its corresponding pass 
transistor to turn on the transistor. If any one of the CAM cells stores data that does not 
match a corresponding bit in the comparand, the corresponding pass transistor is turned 
off and the match line remains pre-charged towards VDD. 

[0043] This operation is further shown in Figure 7 in which from time TO to time Tl 
there is a mis-match state. At time Tl all of stored bits match the comparand such that aU 
of the pass transistors are enabled to discharge the match lines towards ground. At time 
T2, the voltage on match line 310 crosses VTH, which is the voltage that delineates a 
match from a mis-match state on the match line. At time T3, a mis-match condition is 
again determined. At time T4 the mis-match condition is represented on match line 610. 
[00441 Returning again to Figure 2, dynamic timing generator 220 transitions the 
latch enable signal to cause latch 202 to capture the match results when all of the match 
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signals accurately reflect the current compare results. In contrast to timing generator 103 
of Figure 1, dynamic timing generator 220 performs one or more compare operations to 
parallel the operation of CAM array 240. Thus, as the generation of the match signals is 
increased or decreased in time by the operation of CAM array 240 under various 
operating conditions, the dynamic timing generator similarly and proportionally 
dynamically adjusts the time at which it transitions the latch enable signal. As such, CAM 
architecture 200 "self-times" when to capture the match results enabling them to be 
captured as soon as they are stable on the match lines. 

[00451 For example, because dynamic timing generator 220 performs a parallel 
comparison operation as that performed by CAM array 240, if operating conditions 
improve such that faster compare operations are reaUzed by the CAM array, the dynamic 
timing generator adjusts the transition of the latch enable signal by a substantially 
proportional amount such that match results from the CAM array are captured earlier in 
time than under relatively slower operating conditions. This may allow the CAM 
architecture to run at higher frequencies under faster operating conditions. Likewise, as 
operating conditions degrade such that slower compare operations are realized by the 
CAM array, the dynamic timing generator adjusts the transition of the latch enable signal 
by a substantially proportional amount such that match results from the CAM array are 
captured later in time than under relatively faster operating conditions. 
[0046] For example, as shown in Figure 8, if a new comparand begins to be analyzed 
on the rising edge of a cycle clock signal 207, and the match results are latched on the 
rising edge of the latch enable signal, then using the rising edges 81 1 and 813 of the cycle 
clock 207, respectively, as a reference point for both latch times 812 and 814, note that 
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the first latch time 812 is observed after time period Tx for a first compare cycle, and the 
second latch time 814 is observed as time period Ty for a second compare cycle. Note 
that time period Ty is longer in time than time period Tx. The second compare cycle 
represents operating conditions that cause CAM architecture 200 to operate slower (i.e., 
generate accurate match results slower) than in the first compare cycle. This difference 
in time periods demonstrates that the latch time of the CAM architecture can be 
dynamically adjusted. Furthermore, in at least one embodiment, a new latch time can be 
estabUshed with each new comparand. As such, the CAM approach of Figure 2 is able 
to vary the latch timing on a comparand-by-comparand basis. 
[00471 By having dynamic timing generator 220 perform a substantially similar 
compare operation as that performed by CAM array 240, the dynamic timing generator 
can be designed to transition the latch enable signal at a time that more quickly (i.e., as 
opposed to conventional simulation designs) indicates when the match signals accurately 
reflect the match results from the CAM array for a given compare operation. This 
advantageously improves the speed at which the match results are latched and when the 
priority encoder can output a matching index, and when the flag logic can output a match 
result. Consequently, this can improve the operating frequency, speed and throughput of 
the CAM architecture over conventional designs. 

[00481 Figure 9 illustrates one embodiment of the operation dynamic timing 
generator 220 and latch 202. For this embodiment, 900 and 901 are performed by 
dynamic timing generator 220, and 902 is performed by latch 202. The amount of time 
that the CAM array 204 uses to determine its match results on match will be referred to 
hereafter as a "comparison time", a "compare time" or the Uke. 
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[00491 At 900, while a comparison operation is being perfomed by the CAM array, 
dynamic timing generator 220 determmes the longest or worst case comparison time that 
the CAM array should experience for the given operating conditions. For one example, 
dynamic timing generator 220 determines the longest comparison time by performing its 
own parallel comparison operations (with the supplied comparand or with a 
predetermined, pre-stored comparand) to mirror the operation of the CAM array. The 
parallel comparison operations may include comparison operations that generate both 
match and mis-match results so as to mirror the slowest functional operation that any row 
of the CAM array could experience in a given compare operation. 
[OOSO] Once dynamic timing generator 220 determines the longest comparison time, 
the dynamic timing generator transitions the latch enable signal at 901 such that the 
match results on match signal lines 210 are captured by latch 202 (at 902) at or beyond 
the just determined longest comparison time. As a result, the match results are latched as 
soon as practicable under the applicable operating conditions. This can avoid idling for 
significant amounts of time prior to latching match results from the CAM array once they 
are ready to be latched. 

[0051] Note that the process of Figure 9 can be continually repeated so that an 
appropriate latch time is continually re-determined over the course of operation of the 
CAM architecture. As such, as described above, the CAM can dynamically adjust to 
changes in operating conditions that arise during its operation. In one embodiment, the 
process is continually repeated to generate a new latch enable signal in every clock cycle. 
Alternatively, a new latch enable signal may be generated at a frequency slower than 
every clock cycle. 
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10052] It is important to point out that numerous embodiments may differ with 
respect to the manner in which dynamic timing generator 220 estabhshes the latch timing 
for latch 202. For example, according to at least one approach, a plurality of comparison 
time measurements are undertaken in order to monitor a plurality of circumstances that 
could noticeably affect the compare operation of CAM array 240. 
[0053] Figure 10 illustrates dynamic timing generator 1 020 that is one embodiment 
of dynamic timing generator 220 of Figure 2. Dynamic timing generator 1020 includes 
match comparison timing circuit 1001, mis-match comparison timing circuit 1006, and 
select circuit 1005. Each of the comparison timing circuits 1001 and 1006 performs 
compare operations to mirror or parallel the similar worst-case match and mis-match 
comparison operations performed by CAM array 240. Various timing circuit designs may 
be used to perform the parallel comparison operations including using one or more 
duplicate rows of CAM cells such as those used in the CAM array. 
[0054] Each comparison timing circuit 1001 and 1006 outputs a comparison result 
signal at respective outputs 1002 and 1004. The comparison result signals are indicative 
of the time required to complete the corresponding match or mis-match operation. Select 
circuit 1005 selects the slower of the comparison result signals to determine when to 
transition the latch enable signal 203. That is, select circuit 1005 effectively recognizes 
which of the match and mis-match comparison times performed by timing circuits 1001 
and 1006 represents the slowest comparison time. This slowest comparison time is also 
indicative of the slowest possible comparison time that any given row of the CAM array 
may take to resolve its own match results. 
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[0055] Dynamic timing generator 1020 uses comparison timing circuits 1001 and 
1006 to mirror the two operations and results (i.e., match and mis-match compare 
operations) that a given row of CAM cells in the CAM may experience during a compare 
operation. For one embodiment, match comparison timing circuit 1001 includes a row of 
CAM cells or compare circuits (manufactured in a similar manner to, and including 
substantially similar circuit components to, those in the CAM array) that performs a 
compare operation with a pre-stored comparand such that the comparison always 
generates a match (e.g., as a match signal on output 1002). Similarly, mis-match 
comparison timing generator circuit 1006 may include a row of CAM cells or compare 
circuits (manufactured in a similar manner as to, and including substantially similar 
circuit components to, those in the CAM array) that performs a compare operation with a 
pre-stored comparand such that the comparison always generates a mis-match (e.g., as a 
signal on output 1002). Select circuit 1005 compares or combines the timing or logic 
states to select the slowest results reflected on lines 1002 and 1004. 
[0056] For one embodiment, each of the comparison result signals is an active high 
gnal that is asserted to the active high state when the respective timing circuit has 
impleted its compare fimction(s). For this embodiment, select circuit 1005 maybe a 
logic AND gate. 

[0057] Two compare timing circuits 1001 and 1006 are included in dynamic timing 
generator 1020 because either a match or mis-match comparison result for a given row of 
CAM cells in the CAM array may take longer. If, either a match or mis-match will 
always take longer, than only one such comparison timing generator circuit need be 
included in dynamic timing circuit 1020. Alternatively, if a row of CAM cells in the 
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CAM array can have more than one state, then additional timing circuits may be included 
in dynamic timing generator 1020, one for each possible output state of the rows of CAM 
cells of the CAM array. 

[00581 Note that match comparison timing circuit 1001 can determine the worst-case 
time that it would take for a match comparison operation to take place in any row of 
CAM array 240. This may be accompUshed, for example, by performmg a series of 
comparison operations to precondition output line 1002 to a particular state before 
generating a match condition. Exemplary embodiments will be described below. 
Similarly, mis-match comparison timing circuit 1006 can determine the worst-case time 
that it would take for a mis-match comparison operation to take place in any row of CAM 
array 240. This may be accompUshed, for example, by performing a series of comparison 
operations to precondition output line 1004 to a particular state before generating a mis- 
match condition. Exemplary embodiments will be described below. 
[0059] Thus, dynamic timing generator 1020 may be tailored so as to test against 
both a worst case match comparison condition and a worst case mis-match comparison 
condition, hi this sense, the design approach of Figure 10 is comprehensive because a 
plurality of potential worst case circumstances are individually measured or determined, 
and the worst result across the plurality of worst case circumstances is used to set the 
CAM'S latch time. As such, the probability of a race condition occurring is low because 
the latch time is set at or beyond what is believed to be the slowest possible CAM 
comparison time. 

[0060] Note that since timing circuits 1001 and 1006 perform the same functions as 
the rows of CAM cells in CAM array 204, that as operating conditions change the 
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comparison results generated by timing circuits 1001 and 1006 adjust in a similar manner 
(e.g., by a similar or proportional amount of time, voltage, etc.) as the match results 
generated by CAM array 240. When the timing circuits 1001 and 1006 are formed using 
similar circuits and geometries (e.g., also use rows of similarly designed CAM cells or 
compare circuits to those used in CAM array 240), changes in operating conditions 
substantially affect both the CAM array and the timing circuits in substantially the same 
way, thus allowing the benefits of on-chip latch time determination to dynamically track 
with changes in operating conditions. 

[0061] Figure 11 illustrates comparison timing circuit 1 100 that is one 

embodiment of the timing circuits 1001 and 1006 of Hgure 10. Comparison timing 
circuit 1 100 includes storage circuit 1 102, compare circuit 1 104, select circuit 1 106, and 
sense circuit 1 108. Compare circuit 1 104 compares the data stored in storage circuit 1 102 
with either comparand A (CMPDA) or comparand B (CMPDB) provided by select circuit 
1 106. CMPDA and CMPDB may be stored in storage circuits or may be provided 
external to the CAM architecture. 

[0062] Select circuit 1 1 06 determines which comparand to supply to compare circuit 
11 04 in response to the comparand select signal CMPDSEL. CMPDSEL may be 
generated, for example, by a control circuit or instruction decoder, a timing circuit, or 
may be free running. Select circuit provides its output to the compare circuit also in 
response to the cycle clock. For one embodiment, select circuit 1 106 is a multiplexer. The 
result of the comparison operation performed by compare circuit 1 104 is provided on 
signal line 1 1 10 and sensed by sensing circuit 1 108. Sensing circuit 1 108 senses the 
jltage change of the comparison resuU on signal line 1 1 10 and detects when the voltage 
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has reached one or more thresholds. Sensing circuit 1 108 then asserts or deasserts a 
signal on signal line 1 1 12. Sensing circuit 1 108 may be any type of sensing circuit 
including a single or differential sense amplifier, a logic gate (e.g., an inverter, AND gate, 
OR gate, etc.) and the like. The sensed output is provided to signal line 1112 that 
corresponds to comparison result outputs 1002 and 1004 of Figure 10. 
[0063] For one embodiment, storage circuit 1 102 and compare circuit 1 104 form a 
row of CAM cells (e.g., such as that shown in Figures 3 or 6), and signal line 1 1 10 
corresponds to a match line. To mirror and parallel the operation of the rows of CAM 
cells in the CAM array 240 of Figure 2, the row of CAM cells formed by storage circuit 
1 102 and compare circuit 1 104 are manufactured in a similar manner to, and include 
substantially similar circuit components to, those in the CAM array. 
[0064] Note that some form of deliberate additional delay may be added to the output 
of comparison timing circuit 1 100 (e.g., via the propagation of one or more logic gates) to 
ensure that a latch time which is derived from the result is beyond the comparison times 
of the individual rows of CAM cells in CAM array 240. By so doing, race conditions can 
be avoided. 

[00651 For alternative embodiments, data stored in storage circuit 1102 may be hard- 
wired to particular logic states (i.e., not stored in memory circuits such as those found in 
CAM array 240) to reduce the size of storage circuit. This can be accomplished while still 
maintaining substantially similar responses between the comparison timing circuit 1 100 
and the rows of CAM cells in CAM array 240 during compare operations across varying 
operating conditions. 
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[0066] In still other embodiments, storage circuit 1 102 and compare circuit 1 104 may 
be replaced by a circuit whose timing characteristics (e.g., its RC time constant) are 
designed to correlate to the comparison timing characteristics of tiie rows of CAM cells 
in CAM array 240 (i.e., substantially tiracks tiie voltage changes on tiie match lines of the 
CAM array in response to changes in operating conditions). Because of the design for 
correlation, tiie output from such a comparison timing circuit can still be used to set tiie 
CAM'S latch time even if a comparison is not actiially performed by tiie timing circuit. 
Again, tiiose of ordinary skill will be able to determine tiie appropriate correlation and/or 
any additional delay to avoid race conditions. 

[00671 The operation of comparison timing circuit 1 1 00 is furtiier illustirated in 
Figure 12. At 1202, CMPDSEL is in a first state such tiiat select circuit 1 106 provides 
CMPDA to compare circuit 1 104. Compare circuit 1 104 compares CMPDA witii tiie data 
stored in storage circuit 1 102 as many times as is required to pre-condition tiie signal line 
1 1 10 to a first state. Subsequently, at 1204, CPMDSEL transitions to a second state such 
that select circuit 1106 provides CMPDB to compare circuit 1106. Compare circuit 1106 
compares CMPDB witii tiie data stored in storage circuit 1 102, and sense circuit 1 108 
senses when tiie voltage on signal hne 1 1 10 has reached one or more tiiresholds. 
[00681 Consider when comparison timing circuit 1 1 00 is operating as a match 
comparison timing circuit such as 1001 of Figure 10. Comparison timing circuit 1 100 
determines tiie worst-case (i.e., slowest) comparison time of any row of CAM cells in 
CAM array 240 to determine a match condition under current operating conditions by 
having CMPDA set to mis-match tiie data stored in storage circuit 1 102 and CMPDB set 
to match tiie data stored in storage circuit 1 102. For example, when CMPDA mis- 
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matches the data stored in storage circuit 1 102 (at 1202), the voltage on signal line 1 1 10 
will be discharged towards ground potential (for a NOR-based row of CAM cells). This 
may take one or more clock cycles to discharge signal line 11 10 as close to ground 
potential as possible, but can be more quickly achieved by having CMPDA mis-match all 
bits of the data stored in storage circuit 1 102. Then, CMPDB is provided at 1204 and 
sensing circuit 1 108 senses when the signal on signal line 1 1 10 is equal to or greater than 
VTH (e.g., as shown in Figure 4). Note that this example has been discussed with 
reference to NOR-based CAM cell operation, but can be readily adapted to NAND-based 
CAM cell operation. 

[00691 Now consider when comparison timing circuit 1 100 is operating as a mis- 
match comparison timing circuit such as 1006 of Figure 10. Comparison timing circuit 
1 100 determines the worst-case (i.e., slowest) comparison time of any row of CAM cells 
in CAM array 240 to determine a mis-match condition under current operating conditions 
by having CMPDA set to match the data stored in storage circuit 1 102 and CMPDB set to 
mis-match the data stored in storage circuit 1 102. For example, when CMPDA matches 
the data stored in storage circuit 1 102 (at 1202), the voltage on signal line 1 1 10 will be 
charged towards the power supply rail VDD (for a NOR-based row of CAM cells). This 
may take one or more clock cycles to charge signal line 1 1 10 as close to VDD as 
possible. Then, CMPDB is provided at 1204 and sensing circuit 1108 senses when the 
signal on signal line 1 1 10 is equal to or less than VTH (e.g., as shown in Figure 4). The 
worst case mis-match pattern (i.e., the one that will cause the signal on signal line 1 1 10 to 
take the longest time to discharge towards VTH and ground) is when only a single-bit of 
CMPDB mis-matches a corresponding single bit in the data stored in storage circuit 1002 
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because only a single CAM cell will be discharging signal line 1110. So, the worst-case 
mis-match pattern may be used to more closely parallel worst-case mis-match operations 
that may take place in any row of CAM cells in CAM array 240. Note that this example 
has been discussed with reference to NOR-based CAM cell operation, but can be readily 
adapted to NAND-based CAM cell operation. 

[0070] Note that comparison timing circuit 1 100 of Figure 1 1 generates a compare 
results on signal line 1 1 10 and 1 1 12 every cycle clock. However, it generates a worst- 
case match or mis-match compare result at a frequency less than every cycle clock. 
Figure 13 illustrates a comparison timing circuit 1300 that is another embodiment of 
either of timing circuits 1001 or 1006 of Figure 10. Comparison timing circuit 1300 
includes mterleaver circuit 1302 and M comparison timing circuits 1 100 as shown in 
Figure 11, where M is any number. Each of the comparison timing circuits generates a 
worst-case match or mis-match comparison result as described above, except that the 
CMPDSEL signal for the respective comparison timing circuits 1 100 are each set to a 
logic one state once every M clock cycles, and each CMPDSEL is set to a logic one state 
in a different clock cycle such that for any given cycle of the cycle clock, one of the 
circuits 1000 is outputting a worst-case match or mis-match comparison result 
(depending on whether it is configured to operate as 1001 or 1006 of Figure 10). 
hiterleaver circuit 1302 combines the outputs of the timing circuits 1100, and provides 
the combined output to signal line 1304 that corresponds to outputs 1002 or 1004 of 
Figure 10. For one embodiment, interleaver circuit 1302 is an OR logic gate. For other 
embodiments, interleaver circuit 1302 may be one or more other logic gates or circuits. 
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[00711 Figure 14 illustrates the operation of comparison timing generator 1 300 of 
Figure 13 using two comparison timing generator circuits 1 100. Consider, for this 
example, that comparison timing circuit 1300 operates as match comparison timing 
generator circuit 1001 of Figure 10. In a first cycle of the cycle clock, CMPDSELi is set 
to a logic one state to select CMPDA which has all of its bits set to mis-match the data 
stored in each of the storage circuits in the timing circuits, and CMPDSEL2 is set to a 
logic zero state to select CMPDB which has all of its bits set to match the data stored 
each of the storage circuits in the timing circuits. In the following second cycle of the 
cycle clock, CMPDSELi is set to a logic zero state to select CMPDB and CMPDSEL2 is 
set to a logic one state to select CMPDA, and the sense circuit of timing circuit 1 100, 
asserts the signal on signal line 1 1 lOi when a match is detected (and deasserts the signal 
on signal line 1 llOi before the next match result is determined by 1 IOO2). In the 
following third cycle of the cycle clock, CMPDSELi is again set to a logic one state to 
select CMPDA, and CMPDSEL2 is again set to a logic one state to select CMPDB, and 
the sense circuit of timing circuit 1 IOO2 asserts the signal on signal line 1 1 IO2 when a 
match is detected (and deasserts the signal on signal line 1 1 IO2 before the next match 
result is determined by 1 lOOi). In this manner, comparison timing circuit 1300 of Figure 
14 outputs a worst-case match comparison timing signal on signal line 1 304 every cycle 
of the cycle clock for use in latching match results from CAM array 240 of Figure 2. 
[0072] Similarly, consider another example in which comparison timing circuit 1 300 
of Figure 13 operates as match comparison timing generator circuit 1006 of Figure 10. 
In a first cycle of the cycle clock, CMPDSELi is set to a logic one state to select CMPDA 
which has all of its bits set to match the data stored in each of the storage circuits in the 
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timing circuits 1 100, and CMPDSEL2 is set to a logic zero state to select CMPDB which 
has, for example, one of its bits set to mis-match a corresponding data bit stored in each 
of the storage circuits in the timing circuits. In the following second cycle of the cycle 
clock, CMPDSELi is set to a logic zero state to select CMPDB and CMPDSEL2 is set to 
a logic one state to select CMPDA, and the sense circuit of timing circuit 1 lOOi asserts 
the signal on signal line 1 1 lOi when a mis-match is detected (and deasserts the signal on 
signal line 1 1 lOi before the next mis-match result is determined by 1 IOO2). In the 
following third cycle of the cycle clock, CMPDSELi is again set to a logic one state to 
select CMPDA, and CMPDSEL2 is again set to a logic one state to select CMPDB, and 
the sense circuit of timing circuit 1 IOO2 asserts the signal on signal line 1 1 IO2 when a 
mis-match is detected (and deasserts the signal on signal line 1 1 IO2 before the next mis- 
match result is determined by 1 lOOi). In this manner, comparison timing circuit 1300 of 
Figure 14 outputs a worst-case mis-match comparison timing signal on signal line 1304 
every cycle of the cycle clock for use in latching match results from CAM array 240 of 
Figure 2. 

[0073] Figure 15 illustrates the operation of comparison timing generator 1300 of 
Figure 13 using three comparison timing generator circuits 1 100. Consider, for this 
example, that comparison timing circuit 1300 operates as match comparison timing 
generator circuit 1001 of Figure 10. In a first cycle of the cycle clock, CMPDSELi and 
CMPDSEL2 are each set to a logic zero state to select CMPDA which has all of its bits 
set to mis-match the data stored in each of the storage circuits in the timing circuits 1 100, 
and CMPDSEL3 is set to a logic zero state to select CMPDB which has all of its bits set 
to match the data stored in each of the storage circuits in the timing circuits. In the 
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following second cycle of the cycle clock, CMPDSEL2 and CMPDSEL3 are each set to a 
logic zero state to select CMPDB, and CMPDSEL, is set to a logic one state to select 
CMPDA, and the sense circuit of timing circuit 1 lOOi asserts the signal on signal line 
1 1 lOi when a match is detected (and deasserts the signal on signal line 1 1 lOi before the 
next match result is determined by 1 IOO2). In the following third cycle of the cycle clock, 
CMPDSELi and CMPDSEL3 are each set to a logic zero state to select CMPDB, and 
CMPDSEL2 is set to a logic one state to select CMPDA, and the sense circuit of timing 
circuit 1 IOO2 asserts the signal on signal line 1 1 IO2 when a match is detected (and 
deasserts the signal on signal line 1 IIO2 before the next match result is determined by 
1 IOO3). In the following fourth cycle of the cycle clock, CMPDSELi and CMPDSEL2 are 
each again set to a logic zero state to select CMPDB, and CMPDSEL3 is again set to a 
logic one state to select CMPDA, and the sense circuit of timing circuit 1 IOO3 asserts the 
signal on signal line 1 1 IO3 when a match is detected (and deasserts the signal on signal 
line 1 1 IO3 before the next match result is determined by 1 lOOi). In this manner, 
comparison timing circuit 1300 of Figure 15 outputs a worst-case match comparison 
timing signal on signal line 1304 every cycle of the cycle clock for use in latching match 
results from CAM array 240 of Figure 2. 

[0074] Figure 15 illustrates the operation of comparison timing generator 1 300 of 
Figure 13 using three comparison timing generator circuits 1 100. Consider, for this 
example, that comparison timing circuit 1300 operates as a mis-match comparison timing 
generator circuit 1006 of Figure 10. In a first cycle of the cycle clock, CMPDSELi and 
CMPDSEL2 are each set to a logic one state to select CMPDA which has all of its bits set 
to match the data stored in each of the storage circuits in the timing circuits 1 100, and 
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CMPDSELs is set to a logic zero state to select CMPDB which has, for example, a single 
one of its bits set to mis-match a corresponding data bit stored in each of the storage 
circuits in the timing circuits. In the following second cycle of the cycle clock, 
CMPDSEL2 and CMPDSEL3 are each set to a logic zero state to select CMPDB, and 
CMPDSELi is set to a logic one state to select CMPDA, and the sense circuit of timing 
circuit 1 lOOi asserts the signal on signal line 1 1 lOi when a mis-match is detected (and 
deasserts the signal on signal line 1 1 lOi before the next mis-match result is determined 
by 1 IOO2). In the following third cycle of the cycle clock, CMPDSELi and CMPDSEL3 
are each set to a logic zero state to select CMPDB, and CMPDSEL2 is set to a logic one 
state to select CMPDA, and the sense circuit of timing circuit 1 IOO2 asserts the signal on 
signal line 1 1 IO2 when a mis-match is detected (and deasserts the signal on signal line 

1 1 102 before the next mis-match result is determmed by 1 IOO3). hi the following fourth 
cycle of the cycle clock, CMPDSELi and CMPDSEL2 are each again set to a logic zero 
state to select CMPDB, and CMPDSEL3 is again set to a logic one state to select 
CMPDA, and the sense circuit of timing circuit 1 IOO3 asserts the signal on signal line 

1 1 103 when a mis-match is detected (and deasserts the signal on signal line 1 1 IO3 before 
the next mis-match result is determined by 1 lOOi). hi this manner, comparison timing 
circuit 1300 of Figure 15 outputs a worst-case mis-match comparison timing signal on 
signal line 1304 every cycle of the cycle clock for use in latching match resuUs from 
CAM array 240 of Figure 2. 

[0075] Note that the match comparison timing circuit 1001 and the mis-match 
comparison timing circuit 1006 of Figure 10 may each have a different number of timing 
circuits 11 00. For example, the match comparison timing circuit may have two timing 
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circuits such as shown in Figure 14, and the mis-match comparison timing circuit may 
have three timing circuits such as shown in Figure 15. 

[0076] Figure 16 shows 16CAM apparatus or device 1600 that is one embodiment of 
a CAM device that includes CAM architecture 200 of Figure 2. The CAM apparatus may 
be formed, for example, as an integrated circuit on a semiconductor substrate. 
[00771 CAM device 1 600 includes addressing unit 1 6 1 0 that is capable of addressing 
one or more rows of CAM cells in CAM array 240 (or one or more individual CAM 
cells). Addressing unit 1610 may include, for example, a decoder that is coupled to word 
lines in the CAM array. The addresses are provided along an address bus 1617. 
[00781 The Read/Write unit 161 1 is capable of reading and writing data words 
from/to one or more rows of CAM cells (or one or more individual CAM cells) in CAM 
array 240. For example, Read/Write unit 161 1 may include one or more sense amplifiers 
for sensing and amphfying data stored in CAM array 240 during a read operation, and 
one or more write drivers for driving data to one or more rows and/or cells in CAM array 
240. The data words are provided along a data bus 1618. An instruction decoder unit 
1619 interprets instructions (e.g., write, read, compare) that are presented along an 
instruction bus 1621. The instruction decoder unit 1619 can also control the timing of 
any read, write or compare operations. If comparands used by the dynamic timing 
generator 220 are held by registers (not shown) associated with the dynamic timing 
generator 220, these comparands may be written/read to/fi-om tiieir registers via, for 
example, the data bus 1618. Here, the instruction decoder 1619 may be tailored to access 
these registers. Additionally, instruction decoder may provide one or more control signals 
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or timing signals to one or more of the other circuit blocks including provide CMPDSEL 
signals to dynamic timing generator 220. 

[0079] Note that one or more of the address bus, data bus, instruction bus or bus 
carrying the comparand data may time multiplexed onto one or more of the other buses. 
[0080] CAM array 240 may also be formed from several CAM array blocks each 
having their own address space as a subset of the entire address space for the CAM 
device. CAM arrays are conventionally blocked out to reduce signal line RC delays and 
speed the operation of the device. Here, each CAM array block may have its own 
dedicated dynamic timing generator, or may share a dynamic timing generator with one 
of the other CAM array blocks. 

[0081] Figure 17 illustrates one embodiment 1700 of an application (e.g., a 
networking application such as a switching card for a switch, a routing card for a router, 
or a line card for either or both a switch and/or a router) that can perform a look-up by 
using CAM device 1600 of Figure 16. A card is board-like structure having a plurality 
of semiconductor chips and the interconnections between them. Typically, the 
semiconductor chips are mounted to the card (e.g., by being soldered to the card or by 
being "plugged into" a socket that has been soldered to the card). The card embodiment 
of Figure 17 includes a processor 1710, ingress circuitry 1730, egress circuitry 1740, 
CAM 1600, associated data storage xmit 1770, traffic manager 1760, and payload storage 
unit 1750. 

[0082] Processor 1710 functions to control the overall operation of the appHcation. 
For example, processor 1710 receives packets from a network medium through ingress 
circuitry 1730, stores the payload of packets in payload storage unit 1750, and processes 
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packet header information to determine look-ups in CAM device 1600 and subsequent 
handling of the packets, as discussed herein. Ingress circuitry includes, for example, 
PHY and MAC devices. Processor 1710 sends out packets on a network medium through 
egress interface circuitry 1740 based on the lookups performed by CAM device 1600. 
Egress interface circuitry 1740 may be connected to a switch fabric or directly to one or 
more other routers or switches. Processor 1710 may be one or more network processor 
units (NPUs), microprocessors, or one or more special purpose processors such as a 
digital signal processor (DSP). In another embodiment, processor 1710 may be another 
type of controller, for example, a field programmable gate array or a general purpose 
processor. The processor 1710, ingress interface circuitry 1730, and egress interface 
circuitry 1740 components of a router are known in the art; accordingly, a detailed 
discussion is not provided. 

[0083] In response to information in a packet header, for a particular packet, 
processor 1710 determines the number and types of lookups to be performed by one or 
more of CAM devices 1600, and forms the search keys for these lookups. The searches 
or lookups may include, for example, Classification lookups, forwarding lookups (e.g.. 
Next Hop or longest prefix match (LPM) lookup, MAC lookup, MPLS lookup, etc.) 
When multiple searches are required, processor 1710 may form a composite search key 
that includes at least two, and as many as all, of the various search keys for the lookups. 
The composite search key may then be provided as a common input string to CAM 
device 1600. CAM device 1600 may selectively identify and extracts the individual 
search keys firom the common input string and provides the individual search keys to 
associated CAM array blocks in CAM device 1600 to perform the lookups. 
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Advantageously, the lookups can then occur concurrently or simultaneously in the CAM 
array blocks of CAM device 1600, thereby increasing overall throughput over 
conventional systems in which searches are processed sequentially. 
[00841 CAM device 1600 may be a multiple block CAM device with each block 
capable of storing a different table for comparand lookups. Alternatively, CAM device 
1600 may represent multiple, single block CAM devices (e.g., with each single block 
CAM device formed on a different integrated circuit substrate) with each CAM device 
used to store a different table for comparand lookup. After one or more lookups are 
executed in CAM device 1600, associated information for matching entries (e.g., 
additional routing information and/or packet information) may be retrieved from 
associated data unit 1770. Processor 1710 then communicates with traffic manager 1760 
to schedule the exit of a packet line card 1700 via egress interface circuitry 1740. 
[0085] Note also that embodiments of the present description may be implemented 
not only within a semiconductor chip but also within machine readable media. For 
example, the designs discussed above may be stored upon and/or embedded within 
machine readable media associated with a design tool used for designing semiconductor 
devices. Examples include a circuit description formatted in the VHSIC Hardware 
Description Language (VHDL) language, Verilog language or SPICE language. Some 
circuit description examples include: a behavioral level description, a register transfer 
level (RTL) description, a gate level netlist and a transistor level netlist. Machine 
readable media may also include media having layout information such as a GDS-II file. 
Furthermore, netlist files or other machine readable media for semiconductor chip design 
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may be used in a simulation environment to perform the methods of the teachings 
described above. 

[0086] Thus, it is also to be understood that embodiments of this invention may be 
used as or to support a software program executed upon some form of processing core 
(such as the Central Processing Unit (CPU) of a computer) or otherwise implemented or 
realized upon or within a machine readable medium. A machine readable medium 
includes any mechanism for storing or transmitting information in a form readable by a 
machine (e.g., a computer). For example, a machine readable medium includes read only 
memory (ROM); random access memory (RAM); magnetic disk storage media; optical 
storage media; flash memory devices; electrical, optical, acoustical or other form of 
propagated signals (e.g., carrier waves, infrared signals, digital signals, etc.); etc. 
[0087] As described above, dynamic timing generator 220 includes or is associated 
with circuitry that performs compare operations that parallel the operation of CAM array 
240 to generate a comparison timing signal that indicates the worst-case match and/or 
mismatch condition for CAM array 240. The comparison timing signal is used to trigger 
assertion of the latch enable signal, which in tum causes latch circuits 202i-202k to 
capture the match results generated by corresponding rows of CAM cells 240r240k. 
Thus, as the generation of match results is increased or decreased in time by the operation 
of the CAM array 240 under various operating conditions, dynamic timing generator 220 
similarly and proportionally dynamically adjusts the assertion of the latch enable signal 
so that the match resuUs are captured as soon as they are stable on the match Unes. In this 
manner, performance of CAM architecture 200 is dynamically optimized under various 
operating conditions. 
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[0088] In additional embodiments described below, additional consideration is given 
to the propagation delay of comparand data as it is transmitted from comparand register 
208 to the rows of CAM cells 240 relative to the propagation delay of the latch enable 
signal as it is transmitted from dynamic timing generator 220 to latch circuits 201 1 -202k 
via the latch enable signal line 203. 

[0089] Figure 18 shows a more detailed configuration 1800 of one embodiment of 
CAM architecture 200 of Figure 2. CAM architecture 1800 includes k rows of CAM 
cells 240i-240k, each coupled to a corresponding match line 210i-210k and to a 
corresponding word line 21 1 1-21 Ik. The CAM cells may be any type of CAM cell 
including binary, temary, or quaternary CAM cells having one or more storage elements 
and one or more compare circuits. The word lines are driven by a well-known address 
decoder 250 in response to an address ADDR, which can be provided by an address bus, 
as a next free address (NFA) by priority encoder 250, or by another suitable circuit. In 
some embodiments, multiple word lines may be used for each row of CAM cells. Match 
lines 210i-210k are coupled to the data inputs of corresponding latch circuits 202i-202k, 
each of which includes an enable input to receive the latch enable signal from dynamic 
timing generator 220 via signal line 203, and includes an output to provide a latched 
match signal on a corresponding signal line 204r204k to, for example, priority encoder 
205. In some embodiments, multiple match lines and multiple latch circuits may be used 
for each row of CAM cells. 

[0090] Each column of CAM cells 241 is coupled to a pair of bit lines BL and BL 
and a pair of comparand signal lines CL and CL . The bit lines BL and BL carry 
complementary data to/from the CAM cells 241 and are coupled to a read/write circuit 
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260, which includes well-known sense amplifies and write drivers to facilitate read and 
write operations to CAM cells 241. Read/write circuit 260 is coupled to an optional data 
bus DBUS. In alternative embodiments, only a single bit line may be used to 
communicate data with the CAM cells 241. The comparand lines CL and CL 
commxinicate complementary comparand data stored in comparand register 208 with the 
CAM cells 241. In altemative embodiments, only a single comparand line may be used to 
communicate comparand data with the CAM cells 241 or one or more of the comparand 
signals may be shared on one or more of the bit lines. 

[0091] During compare operations, comparand register 208 transmits comparand data 
to columns of CAM cells 241 via corresponding comparand lines CL and CL, and 
dynamic timing generator 220 transmits the latch enable signal to latch circuits 202i-202k 
via latch enable signal line 203 as described above. Because of propagation delays 
associated with CL and CZ, each row of CAM cells 240 receives the comparand data 
slightly later than the previous row and slightly before the next row. As a result, rows of 
CAM cells 240 positioned further fi-om comparand register 208 commence compare 
operations later than rows of CAM cells 240 positioned closer to comparand register 208. 
Accordingly, because of propagation delays associated with CL and CL , the match 
results for rows of CAM cells 240 positioned further firom comparand register 208 
generally become valid later than the match results for rows of CAM cells 240 positioned 
closer to comparand register 208. Similarly, because of propagation delays associated 
with signal line 203, latch circuits 202 positioned further firom timing generator 220 
receive the latch enable signal, and thus capture match results from corresponding match 
lines 210, later than latch circuits 202 positioned closer to timing generator 220. 
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[0092] If the comparand lines have different loading (e.g., resistance and capacitance 
of the signal line and circuits coupled thereto) than signal line 203 (e.g., because the 

circuits coupled to CL and CL are different than those coupled to signal line 203), then it 
is possible that the latch enable signal will cause one or more of the latch enable circuits 
to latch match results before valid match results are valid. 

[0093] As indicated above, a delay circuit can be applied to the output of the dynamic 
timing generator to delay when the latch enable signal reaches the latches. Altematively, 
the operation of dynamic timing generator 220 can be initially delayed for a given 
compare operation such that it generates its latch enable signal later in time than if it were 
not so initially delayed. For another embodiment, loading on latch enable signal line 203 
can be modeled after the loading on comparand line pair CL/ CL so that the propagation 
delay associated with transmitting comparand data to successive rows of CAM cells 

240i-240k along CUCL substantially matches or is less than the propagation delay 
associated with transmitting the latch enable signal to successive latch circuits 202i-202k 
along signal line 203. 

[0094] Figure 19 shows a CAM architecture 1900 in accordance with one 
embodiment of the present invention. CAM architecture 1900 includes all the elements 
of CAM architecture 1800 of Figure 18, with the addition of a plurality of load elements 
1902rl902k coupled to latch enable signal line 203. As shown m Figure 19, there is one 
load element 1902i- 1902k associated with each corresponding latch circuit 202 1 -202k, 
however, for other embodiments, fewer load elements may be used. Load 
elementsl902i-1902k, together with latch circuits 202i-202k, model the load on the 
comparand lines such that the total load on latch enable signal line 203 is greater than or 
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equal to that of the comparand lines. As a result, as match results become valid on match 
lines 210, the latch enable signal on signal line 203 will cause latches 202 to capture the 
correct match results for a given compare operation. 

[0095] For example, if propagation delays associated with CL and CL result in the 
comparand data reaching the last row of CAM cells 240k at a time t_ delay after the 
comparand data reaches the first row of CAM cells 240i, then load elements 1902r 1902k 
can be configured to suitably load signal line 203 such that the asserted state (e.g., the 
rising edge) of the latch enable signal reaches the last latch circuit 202k at least t_delay 
after it reaches the first latch circuit 202i. In one embodiment, load elements 1902i-1902k 
can be configured to suitably load signal line 203 such that the propagation delay on the 
comparand lines approximately equals the propagation delay on the latch enable signal 
line. In this manner, assertion of the latch enable signal can be timed to coincide with the 
match resuhs for the first row of CAM cells 240i becoming vaUd to optimize 
performance. 

[00961 Although the embodiment of Figure 19 is shown to include dynamic timing 
generator 220, for other embodiments of CAM architecture 1900, dynamic timing 
generator 220 can be replaced with a static timing generator such as, for example, timing 
generator 103 of Figure 1 (e.g., a pulse circuit or a delay circuit). 
[0097] Load elements 1902rl902k can be any suitable type of load circuit that 
increases the propagation delay associated with latch enable signal line 203. For some 
embodiments, each load element 1902 models the capacitive load on CL and CL 
resulting from the CAM cell 241 in a corresponding row of the CAM array. For 
example, each load circuit may be another CAM cell such as CAM cells 241, or only the 



36 



002489.P039X 



portions of a CAM cell 241 that load the comparand signal line. Figure 20 shows an 
NMOS transistor 2000 that is one embodiment of load elements 1902 of Figure 19, 
although other suitable capacitive elements can be used. Transistor 2000, which has a 
gate (G) coupled to signal line 203 and its source (S) and drain (D) terminals coupled to 
ground, is configured as a capacitor that models the capacitive load on a comparand line 
pair CL and CL resulting fi-om the CAM cell 240 in the corresponding row of CAM 
array. Thus, the capacitive load of transistor 2000 depends upon the configuration of the 
compare circuit used in CAM cells 240 and may either approximately model the 
capacitive load of the compare circuit or model such capacitive load in conjunction with 
the capacitive load provided by the corresponding latch 202. The gate capacitance of load 
transistor 2000 can be adjusted by manipulating its device geometries and characteristics 
(e.g., by manipulating its channel length/width, its gate oxide type and thickness, its 
doping concentration, and the like). 

[0098] The embodiment of CAM architecture 1900 is illustrated in Figure 19 and 
described above as having timing generator 220 located at the top of the array of CAM 
cells along with comparand register 208 such that the comparand data and the latch 
enable signal propagate in the same direction down the CAM array (e.g., fi-om the first 
row of CAM cells 240i to the last row of CAM cells 240k) during compare operations. 
As described above, for some embodiments, timing generator 220 can include one or 
more repUca rows of CAM cells that model compare operations of CAM cell rows 240 to 
generate a comparison timing signal that triggers assertion of the latch enable signal. For 
other embodiments, dynamic timing generator 220 may be located on the opposite end of 
the CAM array from the comparand register, for example, as shown in Figure 21. 
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[0099] Figure 21 shows a CAM architecture 2100 in accordance with another 
embodiment of the present invention. CAM architecture 2100 includes all the elements 
of CAM architecture 1800 of Figure 18, with the addition of signal generator 2102 and a 
plurality of load elements 2104i-2104k coupled to signal line 2103. As shown in Figure 
21, there is one load element 2104i-2104k associated with each corresponding CAM row 
240i-240ic, however, for other embodiments, fewer load elements may be used. Load 
elements 2104i-2104k model the load on the comparand lines such that the total load on 
signal line 2103 is greater than or equal to that of the comparand lines. Load elements 
2104i-2104kmay be, for example, the load elements 2 104i -2 104k described above with 
respect to Figure 19 and Figure 20. For one embodiment, an unused column of CAM 
cells (e.g., a column of CAM cells that do not affect the match results for the 
corresponding rows of CAM cells) may be used as load elements 2104i-2104k, and one 
of its comparand signal lines may be used for signal line 2103. The signal line 2103 may 
be fabricated in the same manner that other column signal lines 2103 are fabricated (e.g., 
using the same conductive layer(s), materials and geometries). For an altemative 
embodiment, signal line 2103 can be fabricated to have the desired loading without using 
load elements 2104i-2104k, but adjusting the conductive properties (e.g., adjusting one or 
more of the conductive material(s) comprising the signal Une, geometries, number of 
fabrication process layers, etc.) of the signal line 2103 to effect the appropriate loading. 
[00100] Signal generator 2102 asserts an enable signal on signal line 2103 at 
approximately the same time that comparand register 208 outputs comparand data on the 

comparand signal lines CL and CL . When dynamic timing generator 220 receives the 
asserted enable signal on signal line 2103, dynamic timing generator 220 outputs its latch 
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enable signal on signal line 203. For an alternative embodiment, the enable signal on 
signal line 2103 enables dynamic timing generator to start its operations. 
[00101] Since dynamic timing generator 220 will not assert the latch enable signal on 
signal line 203 until after the entire CAM array has received the comparand data and has 
commenced its compare operations, the latch enable signal on signal line 203 will cause 
latches 202 to capture the correct match resuUs for a given compare operation. 
[00102] Signal generator 2102 can output and assert the enable signal on signal line 
2103, for example, in response to the cycle clock. Signal generator 2102 may be any 
suitable signal generator circuit. For one embodiment, signal 2102 is a one-shot circuit. 
For other embodiments, signal generator 2102 may be part of comparand register 208 
(e.g., a bit stored in comparand register 208) or may be the cycle clock or a derivative 
thereof. 

[001 03] For other embodiments of CAM architecture 2 1 00, signal generator 2 1 03 may 
assert the enable signal on signal Une 2103 after the comparand data is driven onto the 
comparand lines. In this embodiment, the load elements 2104i-2104kmay be designed to 
provide at least the same load as that on the comparand signal lines, or load elements 
2104i-2104kmay provide more or greater delay to the enable signal on signal line 2103. 
[00104] While Figure 21 shows that dynamic timing generator 220 may be disposed at 
the opposite end of the CAM array from comparand register 208, in other embodiments, 
dynamic timing generator 220 may be disposed at other location relative to comparand 
register 208. For one embodiment, multiple dynamic timing generators may be used. For 
one embodiment, multiple dynamic timing generators may be coupled to signal line 2103 
and each transmitting a local latch enable signal to a subset of the latches 202 1 -202k. For 
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one example, a first dynamic timing generator may be coupled to the midpoint of signal 
line 2103 and transmit a local latch enable signal to latches 202i-202k/2, and a second 
dynamic timing generator may be coupled to the end of signal line 2103 and transmit a 
local latch enable signal to latches 202k/2+i-202k. In other examples, any number of 
dynamic timing generators and local latch enable signals may be used. 
[00105] For another embodiment, comparand register 208 may be disposed between 
segments or subarrays 2202a and 2202b of the CAM array and multiple dynamic timing 
generator circuits 220a and 220b niay be used, as shown, for example, in CAM 
architecture 2200 of Figure 22, to reduce the propagation delay times on the comparand 
lines and latch match resuUs earlier in time than that shown in Figure 21. CAM subarray 
2202a includes the first X rows of the CAM array 240i-240x, and CAM subarray 220B 
includes the balance of the K rows of CAM array 240x+r240K. While X may be one half 
of K, in other embodiments, X may be any number and the comparand register disposed 
between rows 240x and 240x+i. 

[00106] Dynamic timing generator 220a is disposed at the top of subarray 2202a and 
receives an enable signal firom signal generator 2102 over signal line 2103a, which is 
loaded by load elements 2104i-2104x. Similarly, dynamic timing generator 220b is 
disposed at the bottom of subarray 2202b and receives an enable signal firom signal 
generator 2102 over signal line 2103b, which is loaded by load elements 2104x+r2104K. 
As in Figure 21, CAM architecture 2200 includes one load element 2104] -2 104k 
associated with each corresponding CAM row 240 1 -240k, however, for other 
embodiments, fewer load elements may be used. Load elements 2104i-2104k model the 
load on the comparand Unes associated with the corresponding subarray such that the 
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total load on the respective signal lines 2103a and 2103b is greater than or equal to that of 
the corresponding comparand lines. Load elements 2104i-2104kmay be, for example, the 
load elements 2104r2104k described above with respect to Figure 19 and Figure 20. 
[00107] Signal generator 2102 asserts enable signals on signal lines 2103a and 2103b 
at approximately the same time that comparand register 208 outputs comparand data on 
the comparand signal lines CL and CL to each of the subarrays. When the dynamic 
timing generator 220a and 220b receive their asserted enable signals on signal lines 
2103a and 2103b respectively, dynamic timing generator 220a outputs its latch enable 
signal on signal line 203a to latches 202i-202x and dynamic timing generator 220b 
outputs its latch enable signal on signal line 203b to latches 202x+i-202ic. For an 
alternative embodiment, the enable signals on signal lines 2103a and 2103b enable the 
dynamic timing generators to start their operations. Since the dynamic timing generators 
will not assert their latch enable signals until after their respective subarrays have 
received their comparand data and has commenced their compare operations, the latch 
enable signals on signal lines 203a and 203b will cause latches 202i-202k to capture the 
correct match results for a given compare operation. 

[00108] For other embodiments, the CAM array may be further segmented into any 
number of subarrays having comparand registers disposed therebetween and associated 
dynamic timing generators associated therewith. 

[00109] In the foregoing specification, the invention has been described with reference 
to specific exemplary embodiments thereof It will, however, be evident that various 
modifications and changes may be made thereto without departing firom the broader spirit 
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and scope of the invention as set forth in the appended claims. The specification and 
drawings are, accordingly, to be regarded in an illustrative rather than a restrictive sense. 
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